A statistical procedure for flagging weak spots greatly improves normalization and ratio estimates in microarray experiments
نویسندگان
چکیده
Over the last few years, there has been a dramatic increase in the use of cDNA microarrays to monitor gene expression changes in biological systems. Data from these experiments are usually transformed into expression ratios between experimental samples and a common reference sample for subsequent data analysis. The accuracy of this critical transformation depends on two major parameters: the signal intensities and the normalization of the experiment versus reference signal intensities. Here we describe and validate a new model for microarray signal intensity that has one multiplicative variation and one additive background variation. Using replicative experiments and simulated data, we found that the signal intensity is the most critical parameter that influences the performance of normalization, accuracy of ratio estimates, reproducibility, specificity and sensitivity of microarray experiments. Therefore, we developed a statistical procedure to flag spots with weak signal intensity based on the standard deviation (δ ij) of background differences between a spot and the neighboring spots, i.e. a spot is considered as too weak if the signal is weaker than cδ ij. Our studies suggest that normalization and ratio estimates were unacceptable when this threshold (c) is small. We further showed that when a reasonable compromise of c (c = 6) is applied, normalization using trimmed mean of log ratios performed slightly better than global intensity and mean of ratios. These studies suggest that decreasing the background noise is critical to improve the quality of microarray experiments. 3 The tremendous advance of the human genome project and development of new high-throughput technologies has created unparalleled opportunities to study the mechanism of disease, monitor disease progression and evaluate effective therapies. As more and more genes are being identified, it has become extremely important to understand the function of these genes and their pathways. Global gene expression analysis is a critical component of this ambitious endeavor. Microarray technologies offer investigators an opportunity to simultaneously monitor the expression of a large number of genes in the context of their biological system. For this study, we concentrated on microarray-based studies monitoring RNA expression levels using cDNA microarrays printed on glass microscope slides. Pat Brown and coworkers developed the protocols widely used to do this type of assay (1,2,3,4,5). The basic strategy for this type of analysis is to isolate RNA from two sources, a reference and an experimental sample. The RNA samples are converted to cDNA and labeled with a fluorophore, typically Cy3 …
منابع مشابه
A statistical method for flagging weak spots improves normalization and ratio estimates in microarrays.
Over the last few years, there has been a dramatic increase in the use of cDNA microarrays to monitor gene expression changes in biological systems. Data from these experiments are usually transformed into expression ratios between experimental samples and a common reference sample for subsequent data analysis. The accuracy of this critical transformation depends on two major parameters: the si...
متن کاملSpatial Analysis of cDNA Microarray Experiments
Some normalization methods assume that the two channels of intensities are related by a constant so that Microarray experiments allow RNA level measurements for many the center of the distribution of the log of the ratio is genes in multiple samples. However, mining the biological information shifted to zero. Chen et al. (1997) proposed an iterative from the large sets of data generated by micr...
متن کاملUse of within-array replicate spots for assessing differential expression in microarray experiments
MOTIVATION Spotted arrays are often printed with probes in duplicate or triplicate, but current methods for assessing differential expression are not able to make full use of the resulting information. The usual practice is to average the duplicate or triplicate results for each probe before assessing differential expression. This results in the loss of valuable information about genewise varia...
متن کاملlimmaGUI: A graphical user interface for linear modeling of microarray data
SUMMARY limmaGUI is a graphical user interface (GUI) based on R-Tcl/Tk for the exploration and linear modeling of data from two-color spotted microarray experiments, especially the assessment of differential expression in complex experiments. limmaGUI provides an interface to the statistical methods of the limma package for R, and is itself implemented as an R package. The software provides poi...
متن کاملIdentifying Differentially Expressed Genes in cDNA Microarray Experiments
A major goal of microarray experiments is to determine which genes are differentially expressed between samples. Differential expression has been assessed by taking ratios of expression levels of different samples at a spot on the array and flagging spots (genes) where the magnitude of the fold difference exceeds some threshold. More recent work has attempted to incorporate the fact that the va...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001